Towards Disambiguating Web Tables
نویسندگان
چکیده
Web tables comprise a rich source of factual information. However, without semantic annotation of the tables’ content the information is not usable for automatic integration and search. We propose a methodology to annotate table headers with semantic type information based on the content of column’s cells. In our experiments on 50 tables we achieved an F1 value of 0.55, where the accuracy greatly varies depending on the used ontology. Moreover, we found that for 94% of maximal F1 score only 20 cells (37%) need to be considered on average. Results suggest that for table disambiguation the choice of the ontology needs to be considered and the data input size can be reduced.
منابع مشابه
DisambiguatingWeb Tables using Partial Data
This work addresses disambiguating Web tables annotating content cells with named entities and table columns with semantic type information. Contrary to state-of-the-art that builds features based on the entire table content, this work uses a method that starts by annotating table columns using automatically selected partial data (i.e., a sample), then using the type information to guide conten...
متن کاملAcquiring Comparative Commonsense Knowledge from the Web
Applications are increasingly expected to make smart decisions based on what humans consider basic commonsense. An often overlooked but essential form of commonsense involves comparisons, e.g. the fact that bears are typically more dangerous than dogs, that tables are heavier than chairs, or that ice is colder than water. In this paper, we first rely on open information extraction methods to ob...
متن کاملFinnish National Ontologies for the Semantic Web - Towards a Content and Service Infrastructure
We present a national ontology development and service framework being developed in Finland in 2003-2007. The framework is based on a set of related core ontologies, most notably on a national upper ontology based on the commonly used Finnish General Thesaurus YSA maintained by the National Library of Finland. The framework implements three ontology services by a web-based system ONKI. Firstly,...
متن کاملA Tool for Creating and Visualizing Semantic Annotations on Relational Tables
Semantically annotating content from relational tables on the Web is a crucial task towards realizing the vision of the Semantic Web. However, there is a lack of open source, user-friendly tools to facilitate this. This paper describes an extension of the TableMiner system, an open source Semantic Table Interpretation system that automatically annotates Web tables using Linked Data in an effect...
متن کاملUpdating Wikipedia via DBpedia Mappings and SPARQL
DBpedia crystallized most of the concepts of the Semantic Web using simple mappings to convert Wikipedia articles (i.e., infoboxes and tables) to RDF data. This “semantic view” of wiki content has rapidly become the focal point of the Linked Open Data cloud, but its impact on the original Wikipedia source is limited. In particular, little attention has been paid to the benefits that the semanti...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013